Growing Self Organising Map Based Exploratory Analysis of Text Data
نویسندگان
چکیده
Textual data plays an important role in the modern world. The possibilities of applying data mining techniques to uncover hidden information present in large volumes of text collections is immense. The Growing Self Organizing Map (GSOM) is a highly successful member of the Self Organising Map family and has been used as a clustering and visualisation tool across wide range of disciplines to discover hidden patterns present in the data. A comprehensive analysis of the GSOM’s capabilities as a text clustering and visualisation tool has so far not been published. These functionalities, namely map visualisation capabilities, automatic cluster identification and hierarchical clustering capabilities are presented in this paper and are further demonstrated with experiments on a benchmark text corpus. Keywords—Text Clustering, Growing Self Organizing Map, Automatic Cluster Identification, Hierarchical Clustering.
منابع مشابه
A Self-Organising Hybrid Model for Dynamic Text Clustering
A text clustering neural model, traditionally, is assumed to cluster static text information and represent its inner structure on a flat map. However, the quantity of text information is continuously growing and the relationships between them are usually complicated. Therefore, the information is not static and a flat map may be not enough to describe the relationships of input data. In this pa...
متن کاملA novel self-organising clustering model for time-event documents
Purpose Neural document clustering techniques, e.g., self-organising map (SOM) or growing neural gas (GNG), usually assume that textual information is stationary on the quantity. However, the quantity of text is ever-increasing. We propose a novel dynamic adaptive self-organising hybrid (DASH) model, which adapts to time-event news collections not only to the neural topological structure but al...
متن کاملNeural Networks: an Exploratory Data Analysis of Logistics Performance
Neural networks are a data processing technique that provides us a powerful tool to handle non-linear data and model complex relationships between data. Self-organising maps, a type of neural networks, has been used successfully as an exploratory data analysis method in applications like presenting the welfare states of the countries or analysing and representing financial data. Logistics inclu...
متن کاملSelf-Organising Maps for Hierarchical Tree View Document Clustering Using Contextual Information
In this paper we propose an effective method to cluster documents into a dynamically built taxonomy of topics, directly extracted from the documents. We take into account short contextual information within the text corpus, which is weighted by importance and used as input to a set of independently spun growing Self-Organising Maps (SOM). This work shows an increase in precision and labelling q...
متن کاملSelf-Organising Maps in Document Classification: A Comparison with Six Machine Learning Methods
This paper focuses on the use of self-organising maps, also known as Kohonen maps, for the classification task of text documents. The aim is to effectively and automatically classify documents to separate classes based on their topics. The classification with self-organising map was tested with three data sets and the results were then compared to those of six well known baseline methods: k-mea...
متن کامل